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ABSTRACT 

Radio resource sharing mechanisms are key to ensuring good per- 
formance in wireless networks. In their seminal paper 1 1 1, Tassiu- 
las and Ephremides introduced the Maximum Weighted Scheduling 
algorithm, and proved its throughput-optimality. Since then, there 
have been extensive research efforts to devise distributed imple- 
mentations of this algorithm. Recently, distributed adaptive CSMA 
scheduling schemes |2| have been proposed and shown to be op- 
timal, without the need of message passing among transmitters. 
However their analysis relies on the assumption that interference 
can be accurately modelled by a simple interference graph. In this 
paper, we consider the more realistic and challenging SINR inter- 
ference model. We present the first distributed scheduling algo- 
rithms that ( i) are optimal under the SINR interference model, and 
( ii) that do not require any message passing. They are based on 
a combination of a simple and efficient power allocation strategy 
referred to as Power Packing and randomization techniques. We 
first devise algorithms that are rate-optimal in the sense that they 
perform as well as the best centralized scheduling schemes in sce- 
narios where each transmitter is aware of the rate at which it should 
send packets to the corresponding receiver. We then extend these 
algorithms so that they reach throughput-optimality. 

1. INTRODUCTION 

The throughput experienced on a given link in wireless networks 
is affected by the interference generated by the transmitters of other 
links. Interference management constitutes the main issue in the 
design of simple and efficient resource allocation (or Multiple Ac- 
cess Control) algorithms for such networks. Solving this issue be- 
comes even more challenging when links have to share radio re- 
sources in a distributed manner. Distributed power control (3][4) is 
often used (e.g. in cellular systems) to tackle this issue. How- 
ever, when links strongly interfere each other, power control is 
inefficient as the set of rates that can be simultaneously achieved 
on the competing links exhibits non-convexities. For such sce- 
narios, scheduling transmissions over time is much more efficient 
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and results in a much larger rate region. Most existing MAC algo- 
rithms for WLANs, Mesh, and AdHoc networks are scheduling al- 
gorithms: transmitters only decide when to be active, and when ac- 
tive, they use a single power level, often the maximum power level. 
In their seminal paper 1 1 ], Tassiulas and Ephremides proposed the 
queue-length based Maximum Weighted Scheduling (MWS) algo- 
rithm, and proved its throughput-optimality (meaning that it can 
stabilize the network whenever this is at all possible). However 
the MWS algorithm is centralized, and often requires to repeatedly 
solve instances of NP-hard optimization problems. 

Over the last two decades, there have been important research ef- 
forts towards the design of low-complexity and distributed versions 
of the MWS algorithm (refer to the related work section for refer- 
ences). Recently, in [2 5-7], simple and throughput-optimal adap- 
tive versions of CSMA have been proposed. These algorithms en- 
joy the property of being fully distributed, in the sense that they do 
not require any kind of message passing among the various trans- 
mitters. However their analysis and performance guarantees rely on 
the strong assumption that interference can be modelled as a sim- 
ple undirected graph (in the interference graph, vertices represent 
links, and an edge between two links mean that these links cannot 
be simultaneously activated). In particular, this simplistic interfer- 
ence model cannot account for the well-known hidden and exposed 
terminal problems, and more generally does not accurately capture 
the very nature of interference. In this paper, we revisit the design 
of efficient and distributed MAC protocols under the more realis- 
tic SINR interference model. Specifically, we aim at answering the 
following question: 

Can we devise fully distributed and optimal scheduling algo- 
rithms for wireless networks under the SINR interference model? 

By fully distributed, we mean that transmitters are not allowed to 
exchange any signalling message, and the only feedback available 
at a given transmitter is the level of interference measured at the 
corresponding receiver (just as in classical distributed power con- 
trol mechanisms |3 4|). Optimal may have several meanings. To 
discuss the different versions of optimality, let us first introduce the 
notion of rate region defined as the set of rates that can be simul- 
taneously achieved on the various links using some (centralized) 
scheduling algorithms, (i) Rate-optimality: in this case, transmit- 
ters always have packets to send, i.e., they are fully backlogged. An 
algorithm is rate-optimal, if it can achieve any rate vector within the 
rate region, (ii) Throughput-optimality: in this case, each transmit- 
ter receives, in its (infinite) buffer, packets arriving according to a 
stationary ergodic process with fixed average rate. An algorithm is 
throughput-optimal if it stabilize^] all buffers as long as the mean 
arrival rate vector belongs to the largest open set contained in the 
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rate region. 

In this paper, we show that surprisingly, the answer to the above 
question is positive, and develop fully distributed and rate-optimal 
scheduling algorithms. We also demonstrate how these algorithms 
can be used towards the design of throughput-optimal scheduling 
schemes. In the proposed framework, we first divide time into 
frames consisting of a fixed number of slots. Each transmitter is 
then allowed to adapt the power levels used in the various slots of 
a frame to achieve the rate it is targeting. Our solution is based 
on a simple power control mechanism, referred to as Power Pack- 
ing (PP). Under this mechanism, each transmitter aims at achiev- 
ing its target rate while minimizing the number of slots actually 
used, hence leaving as many radio resources as possible to the other 
transmitters. PP algorithms are shown to be rate-optimal when two 
links compete for the use of resources. However, in more general 
networks and in some rare scenarios, they may fail at achieving 
certain rate vectors that could have been realized using centralized 
scheduling. By just adding to the algorithms some level of random- 
ization in the power allocation, we overcome this issue and recover 
rate-optimality. All the proposed algorithms are simple and do not 
require any message passing: each transmitter adapts its power lev- 
els in the various slots depending on the observed interference lev- 
els. To our knowledge, the proposed algorithms constitute the first 
scheduling schemes that are fully distributed (no message passing) 
and optimal under the SINR interference model. 

The paper is organized as follows: 

(i) In Section 2, we present a brief overview of the existing liter- 
ature on distributed resource allocation algorithms in wireless net- 
works. 

(ii) In Sections 3 and 4, we present our generic framework, Power 
Packing algorithms and explain their rationale. 

(iii) We establish the rate-optimality of Iterative Power Packing al- 
gorithms for 2-link networks in Section 5. 

(iv) For more general networks, we explain, in Section 6, why It- 
erative Power Packing algorithms may in some rare cases fail. To 
solve this issue, we introduce some Perturbed versions of Iterative 
PP algorithms and show their rate-optimality. 

(v) In Section 7, we show how our rate-optimal algorithms can be 
adapted to achieve throughput-optimality. 

vi) Finally, in Section 8, we illustrate the efficiency of our algo- 
rithms using numerical experiments. 

2. RELATED WORK 

There have been, over the last two decades, a tremendous re- 
search effort towards the design of distributed resource sharing mech- 
anisms in wireless networks under various interference models (see 
e.g. surveys (8]|9|). For the simplistic interference graph model, 
researchers have developed scheduling algorithms that implement 
the celebrated throughput-optimal MWS algorithm [T] in a dis- 
tributed manner. Some of these algorithms use message passing, 
see e.g. [10], some others do not require message passing, e.g. as 
the adaptive versions of CSMA, see e.g. (2][5}|7). 

In this paper, we are interested in the more realistic SINR in- 
terference model. This model has also attracted a lot of attention 
recently, see e.g. |1 1|{14| . For example in 1 1 2| . the authors derive 
utility-optimal power control schemes, but the achieved rate region 
is restricted to that achieved by power control only. In |13||14| , 
the authors design schemes also enabling time sharing, and hence 
scheduling. These schemes implement the MWS algorithm, but re- 
quire message passing (basically, a transmitter need to know its im- 
pact on the throughputs on other links). In a series of papers |15f - 
|17| , Bambos et al. design power control algorithms that ressem- 
ble Foschini-Miljanic algorithm [3,4] in the sense that the power 



update at a transmitter only depends on the measured interference 
level, and on some local queue size. These schemes are fully dis- 
tributed, and seem to realize time sharing when needed. However, 
their optimality has not been established, and there may be network 
examples where these schemes are not optimal. 

3. MODELS AND PRELIMINARIES 

3.1 Network model 

We consider a network consisting of N interfering links (transmitter- 
receiver pairs). We are primarily interested in the design of rate- 
optimal algorithms, and so each link i has a target rate requirement 
R\ (corresponding to the QoS requirements of the underlying ap- 
plication). To achieve this target rate, link-i transmitter may adapt 
its transmission power pi. The transmission power at any trans- 
mitter cannot exceed P ma x. Links interfere, and we assume here 
that each receiver treats interference as noise. Let gji denote the 
channel gain from link-j transmitter to link-i receiver. Thermal 
noise is Gaussian, with power No. Under these assumptions, the 
maximum rate that link i can achieve can be written as: ri(p) = 

f ( N a +Y^ iiPi g ■ P ) ' wnere /(') i s an increasing positive concave 
function, typically f(x) = log(l + x), andp = (pi, . . . ,Pn). 

Notation. Let Wbea subset of . We denote by conv(W) the 
convex hull of U, and by dhi the Pareto-boundaty of U: x G dU 
iff x G U and Vy G U, y > x coordinate-wise implies that x = y. 
We further define U = {r G K+ : 3P G U,Wi, n < Ri} as the 
smallest coordinate-convex set containing U. 1 = (1, . . . , 1). 

3.2 Power control vs. Scheduling 

We define 1Z\ C = {r(p) : Vi,p; G [0,P max ]} as the set of 
vectors representing rates that can be achieved on the various links 
using power control. This set is known to be non-convex, and may 
exhibit different types of shapes, depending on the values of gains 
(gij, i,j), i.e., on the network geometry. Let Si = {r(p) : Vi,p; G 
{0, Pmax}} be the set of vectors representing link rates achieved 
using binary power control, i.e., for any i, link-i transmitter either 
remains silent or transmits at maximum power P max . Si is referred 
to as the set of schedules. The set 7J. schcd of link rates that can be 
achieved by switching schedules over time is the convex hull of Si : 
7?. sc ed = conv(Si). Now we may allow transmitters to use both 
power control and time sharing. In this case, the set of achievable 
rate vectors is 1Z — conv(7?.5 >c ). In general, both power control 
and time sharing are required, in the sense that we may have for 
the same network: K\ c C 71 and K schcd c TZ. We illustrate these 
observations in Figure [T] where we depict the Pareto-boundaries 
dTl V i and dK Bchcd of the set TL\ C and ^ sched , respectively, for 
different interference scenarios. When links strongly interfere each 
other, time sharing (scheduling) is enough, whereas when interfer- 
ence becomes weaker, power control may be necessary. In this 
paper, our goal is to design fully distributed algorithms enabling 
the various links to reach their target rates R l — (R\, . . . , R%), 
provided that P* G P. schcd . 

3.3 Multi-slot systems 

To share radio resources among links, we divide time into frames. 
Each frames consists of a fixed number M of time slots of equal du- 
rations. If each transmitter is allowed to use different power levels 
on the various slots, the rates R(p) achieved on the various links 
can be written as: 
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Figure 1: Rate regions under power control and scheduling: 
strong (left) and weak (middle and right) interference cases. 



where p = (p irn ,i = 1, ■ ■ ■ , N, m = 1, . . . , M) and p im is the 
power level used by link-i transmitter on the m-th slot in each 
frame. The set of achievable rates using such multi-slot power con- 
trol is then: TZ p A c t = {R(p) : Vi,Vm,p lm £ [0,P max ]}. TZ p A j can 
also be expressed as combinations of rate vectors in TZ pc : TZ^j = 
{r : 3s m £ TZ\ c ,m = 1, . . . , M : r = -jj Y^, m =i s '™}- Observe 
that we do not impose any constraint on the total power used by a 
transmitter per frame. 

Now consider scenarios where transmitters are allowed, in a given 
slot, either to use maximum power P max or to remain silent. As 
earlier, we may define a set Sm of schedules: 

S M = {R(p) : Vi,Vm,p iim £ {0,P max }}. 

Sharing time among the various schedules in Sm increases the set 
of achievable rates, i.e., Si C 5a/. Observe that the set of achiev- 
able rates on the various links using a single schedule in Sm is Sm 
is the smallest coordinate convex set containing Sm- The various 
notions of rate regions and their Pareto-boundaries are illustrated 
in Figure[2] 

Note that designing distributed radio resource allocation schemes 
achieving any P* £ 7^ schcd [ s difficult for this requires to identify 
the various proportions of time schedules in Sa/ are used. Design- 
ing schemes achieving any R £ Sm may seem easier because in 
this case we only need to identify a single schedule in Sm satisfy- 
ing the rate requirements. 

As stated in the following lemma, when the number of slots per 
frame is large, we can achieve the largest rate region 1Z by just 
implementing power control per slot, and 7£ schod by choosing a 
fixed schedule from 5a/- All proofs are presented in appendix. 

Lemma 3.1. linu/^oo Wfi = TZ, liniM-^ S M = 7l schcd . 

Here limM->oo Am = B means that for every point R £ B, 
there exists a sequence of points (Xm, M > 1) such that Xm £ 
Am for all M, and limA/->oo Xm = R. 

In practice, we observe that the introduction of frames, even of 
small sizes, considerably increases the rate region: in other words, 
the sequence of sets Sm, M = 1,2, ... rapidly approaches 7Z SC ed . 
Based on this observation and on previous lemma, we use the fol- 
lowing strategy to design distributed resource allocation schemes 
approximately achieving rates in 7?. schcd : (i) We select a frame 
size M so that 5a/ provides a good approximation of 7Z Bchcd , e.g. 
M — 16; (ii) we devise distributed resource allocation schemes 
achieving any rate vector in Sm- 

4. POWER PACKING 




Figure 2: Rate regions under power control (top-right, top-left) 
and scheduling (bottom-right, bottom-left) in multi-slot sys- 
tems. For scheduling, the black dots correspond to the Pareto- 
boundary of 54. The network for figures on the left (resp. right) 
corresponds to gains gtj = l,Vi, j (resp. gn — 2000, 512 = 0.4, 

<722 = 0.6, £21 = 0.4), No = 0.1, P m ax = 1. 

In this section, we present power packing algorithms for the 
multi-slot systems introduced in the previous section. When exe- 
cuting such algorithm, a transmitter aims at minimizing the number 
of slots actually used (a slot is used on a link, if the corresponding 
transmitter selects a strictly positive power level in this slot) while 
achieving the target rate. To run power packing algorithms, trans- 
mitters just need to measure the interference generated by other 
transmitters in the slots composing a frame. 

4.1 Algorithms 

Let Ii m (p) denote the interference perceived at link-i receiver 
during the m-th slot of the frame, given the power allocation p = 
(jp) = No + djiPjrn. We also introduce hi : 

[0,P max ] M xRf ^R+that gives the rate on link i as a function 
of link-i transmitter power levels, and perceived interference levels 

in the various slots: hi(jpi,h) = i £m=i / ( Pi }"l" ) ' 

4.1.1 Power Packing ( PP) algorithm 

Power packing algorithm is executed by a transmitter in response 
to the observed interference levels in the various slots of a frame. 
The principle of power packing is to sequentially fill with power 
slots in increasing order of perceived interference and until the tar- 
get rate is reached. If the latter cannot be reached, the transmitter 
just remains silent in all slots. The algorithm, whose pseudo-code 
is presented below, is illustrated in Figure[3] 

PP algorithm. (Executed at link-i transmitter) 

Input: target rate R\, interference levels Ii = (P m ) m . 

1. Compute the rate Ri = foi(P max l, p) achieved using maxi- 
mum power in each slot, 

2. If Ri < R*: select power allocation pi = (0, . . . , 0); 
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Figure 3: Example of power allocation obtained after power 
packing algorithm - frame size M — 4. 



3. If Ri > R\: order slots in increasing interference levels: 
let a be a permutation of {1, . . . , M} such that ij CT n) < 
• • -JiaiM)- Define 



i m P 
= mm{m : — > ) > Ri} 

M f— f ii£r(fc) 



Select the unique power allocation such that: Vm < m, 

P«r(m) = -Pmax,Vm > m, p i(T ( m ) = 0, and P ■ = hi(pi,Ii). 



4.1.2 Binary Power Packing (BPP) algorithm 

The PP algorithm has a binary version, where the transmitter is 
allowed in a given slot to either use full power P m ax or remain 
silent. BPP algorithm is identical to PP algorithm, except for step 
3 where the power allocation differs: If Ri > R\, the transmitter 
uses the power allocation p i<y{m) = P m axl{m<m}- 

4.2 Game theoretical interpretation 

We now provide a game theoretical perspective on PP and BPP 
algorithms. Consider a noncooperative game played by the N trans- 
mitters. Each transmitter competes rationally against the others by 
selecting a power allocation across the M available slots. The set 
of strategies available to any transmitter consists of all possible 
power allocation across slots. In the case where transmitters can 
use any power level between and P m ax, the set of strategies is 
V — {pi '.Win, pi m £ [0, P m ax]}, whereas in case of binary power 
control, this set reduces to Vb = {pi : Vm,j), m 6 {0,P ma x}}- 
The utility function Ui(p) of transmitter i is defined as follows: 



EM pi rt 
m=l T~ 



(P) 



where C is a positive 



Ui(p) = Cx l {Ri(p) > R t } 

constant such that C > Ylm=i PmaxflM/Wo, for any link i. We 
denote by Q(R t ) (resp. £b(P*)) the same defined above when the 
set of strategies is V (resp. Vb)- It can be easily shown that with 
our choice of utility functions, the PP and the BPP algorithms exe- 
cuted by link-i transmitter can be interpreted as the best response to 
the power allocations p~i — (pjm)j^i : m used by the other trans- 
mitters. In other words, assume that the power allocation pj used 
by link-j transmitter is fixed for all j ^ i. These allocations re- 
sult in interference levels (p m ) m at link-i receiver. For example, 
the power allocation obtained when link-i transmitter executes PP 
algorithm under these conditions solves the following optimization 
problem: maximize Ui(qi,p-i), over qi £ V. 

5. TWO LINK CASE: ITERATIVE POWER 
PACKING 

In this section, we restrict our attention to two-link networks. We 
propose and analyze the convergence of Iterative Power Packing 
(IPP) algorithms. The latter consist in letting transmitters sequen- 
tially update their power allocation using PP or BPP algorithms. 



5. 1 IPP and IBPP algorithms 

To define IPP and IBPP algorithms, we first introduce a sequence 
s — (s[t])t>i, s[t] £ {1, 2}, defining the order in which transmit- 
ters update their power allocation. We assume that the sequence 
satisfies the following property, stating that each transmitter gets to 
update its power allocation an infinite number of times: 

(PI) Vt > 1, 3ti, t 2 > t s[ti] = 1 ands[t 2 ] = 2. 

This property is referred to as liveness property in game theory. A 
sequence of updates satisfying this property is in principle easy to 
generate in a distributed manner, for example using independent 
Poisson clocks with identical rate at the various transmitters. Refer 
to §6. 1 for more details. We are now ready to define IPP algorithm: 

IPP algorithm. 

Input: target rate vector R t , update sequence s, initial power allo- 
cation p[0]. 

For each step t > 1: Let i = s[t]. 

1. Link-i transmitter measures interference levels Ii(p[t—1]) = 
{Iim(p[t — l])m in the different slots; 

2. Link-i transmitter runs PP algorithms with inputs R\ and 
h(p[t-l]). 



IPP algorithm has a binary version, IBPP algorithm, obtained 
by just replacing PP algorithm by the BPP algorithm in the above 
pseudo-code. IPP and IBPP algorithms correspond to the best re- 
sponse dynamics or Nash dynamics of the games Q (R) and Qb{R 1 ), 
respectively. They can easily be implemented in a fully distributed 
manner: when a transmitter updates its power allocation, it only 
needs to measure interference levels on the various slots and to 
know its own target rate. 

5.2 Convergence 

To study the convergence of IPP and IBPP algorithms, we in- 
troduce the notion of repulsive power allocation. We say that p = 
{pi , P2) £ [0, P m ax] 2M is repulsive if and only if there exist a per- 
mutation a of {1, ... , M} and two integers mi, m-z G {0, 1, . . . , M+ 
1} such that for all m 6 {1, . . • , M} (i)m < mi implies pi a ( m ) = 
Pmax, and m > mi + 1 implies p\ a i m \ = 0, (ii) m > m 2 implies 

P2a(m) = Pnax and TTl < TTll - 1 implies TJ 2 <T(m) = 0. 

The set of rate vectors that can be achieved using repulsive power 
allocation is then defined as: 

VJm P = {R £ R+ : 3p repulsive: R = R(p)} 

In the case the power allocation is binary, we similarly define: 

TIm PP = {Eel+:3pG {0, Pmax} 2M repulsive: R < R{p)} 

In what follows, we show that TZ IP i P (resp. TZ]^ PP ) is the rate 
region achieved under IPP (resp. IBPP) algorithm. 

THEOREM 5.1. Let R* e TZm P (resp. g K lPPP ). Then from 
any initial power allocation, IPP (resp. IBPP) algorithm con- 
verges to a repulsive power allocation p £ [0,P ma x] 2M (resp. 
P e {0, Pmax} 2M J such that P* = R(p) (resp. R f < R(p)). 

From a game theoretical perspective, this result states that if 
R* £ V} pp ', then the Nash dynamics converge to a pure Nash Equi- 
librium corresponding to a repulsive power allocation and achiev- 
ing the target rates P'. FigureUlillustrates the rate regions 7?.^ 
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Figure 4: Rate regions obtained through IPP algorithm. 



We do not represent 7£™ pp , because in fact, 7£™ pp = Sm- 
This can be shown by applying the following argument: Let p be 
a binary power allocation; modify this allocation such that (i) the 
number of slots used by each transmitter is not changed, and (ii) 
the new allocation is repulsive. It is easy to see that the new al- 
location provides greater rates to all transmitters, notice however 
that TZm P is smaller than IZ^: this is true when the initial rate 
region 1Z\ C has concave parts - see Fig. El (right). In this case, 
some points of the Pareto-boundary of TZ'ff can only be achieved 
by non-repulsive power allocations of the type p = (xl, -Pmax 1 ) i 
where x € [0, P max ]. 

To conclude this section, one can show (as in Lemma 1) that 
TZm P and 7l IPPP approximate TZ Bchcd (when M is large). 
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As a consequence, any target rate vector R* inside ~JZ Bchcd can 
be achieved either using IPP or IBPP algorithm, provided that the 
frame size is large enough. In other words, IPP or IBPP algo- 
rithms are approximately rate-optimal in 2-link networks (we will 
give a more precise definition of what we mean by "approximately 
rate=optimal" in the next section). 

6. MULTIPLE LINK CASE: ITERATIVE PER- 
TURBED POWER PACKING 

In this section, we consider general networks with more than two 
links. We first explain why IPP or Binary-IPP may fail at converg- 



ing for some specific target rates in 1Z S ' 



We then present two 



binary power control algorithms to overcome this issue. 

An example of networks and target rates where IPP does not 
work is as follows. Consider a network consisting of 3 links shar- 
ing the same receiver (Access Point scenario), and let M = 3 
slots. The two first transmitters are close to the receiver, whereas 
the third one is further away. Assume that the target rates can be 
achieved by the unique following power allocation: pi = p2 — 
(Pmax, Pmax, 0) and p 3 = (0, 0, P max )- This happens for exam- 



pleifP| = §/(S— 



:).» = 1.2 



v ,, and Rl > f( N P " 1 ^ 933 , 
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(in words, the third link cannot accomodate any kind of interfer- 
ence). Now the problem stems from the fact that if transmitters 1 
and 2 select their allocation using PP first, then they would pick 
pi = (Pmax, Pmax, 0), p 2 = (0, 0, P m ax). The third transmitter 
on the other hand needs to be alone in a slot to be satisfied (i.e., to 
achieve its target rate), but it cannot, and hence remains silent. The 
issue with IPP is actually common to all distributed power control 
protocols. A transmitter that causes low interference to others, but 
that is strongly interfered by others, has difficulties indicating its 
state to others through power control. Similarly, a link that cannot 



suffer much from interference has difficulties in gauging the impact 
of its power allocation on other links. 

The proposed solution to this problem marries Power Packing 
principle and randomization. PP is used to (quickly) reach a fea- 
sible power allocation, when PP can indeed go there. We believe 
that in most cases PP actually finds a feasible allocation. Random- 
ization only helps IPP algorithm when the latter cannot converge to 
the desired allocation. Thus, the proposed schemes can be thought 
of as the perturbed version of IPP algorithm. 

6.1 Iterative Perturbed Binary PP algorithm 

The key idea of the algorithm is to force transmitters that are 
satisfied but whose power allocation is not compatible with any 
globally feasible allocation to explore other power allocations. This 
exploration is here triggered when unsatisfied transmitters create 
enough interference so that the target rate of satisfied links cannot 
be achieved anymore. The algorithm works as follows. First we 
generate a sequence of transmitters selected to update their power 
allocation over the frame. Then the power updates satisfy rules that 
we describe below. 

Update sequences. As for IPP and IBPP algorithms, transmitters 
update their power allocation sequentially. The sequence of updates 
is driven by s — [s\t])t>o, assumed here to satisfy the following 
property: 

(P2) (s[t])t>o is a stationary ergodic Markov chain with state space 
{1, . . . , AT}, such that P[s[t] = i] > for all transmitter i. 

A sequence satisfying (P2) may be generated when updates are trig- 
gered by independent Poisson clocks of identical rates, say v, at the 
various transmitters. To be more specific, when the clock of a trans- 
mitter ticks, the latter starts a power update at the next frame. When 
the common clock rate is relatively low (compared to the inverse of 
the frame duration), it is very unlikely that updates at two trans- 
mitters overlap (the convergence of our algorithms holds even in 
case of unfrequent update overlapping - in fact convergence takes 
a finite number of updates, and so we just need that such sequence 
of updates occurs with positive probability). Under the above sce- 
nario, observe that for each new update, the selected transmitter 
is selected uniformly at random, so that (P2) is satisfied. Note also 
that the time between updates occur at instants of a Poisson process 
of mean rate Nv. 

Updating rules. When an unsatisfied transmitter is picked for a 
possible update, it picks a power allocation as per the Binary-PP 
algorithm with probability (w.p.) (1 — ai), and picks a random 
allocation w.p. ai. A random power allocation can be obtained by 
using power P ma x on each slot w.p. 1/2 independently of the power 
levels used in other slots. When a satisfied transmitter i is selected, 
it checks whether its target rate has been achieved because of its 
own power allocation decision in the past (/3j = 1) or because of 
changes in the power allocation by other transmitters (f3i = 0). It 
can for example happen that an other transmitter decided to remain 
silent (applying Binary-PP algorithm), which made i satisfied. In 
this case, the power allocation used by i might not be compatible 
with any globally feasible allocation, and transmitter i should ex- 
plore other allocations. Thus in the algorithm, when /9j = 0, i does 
not update its power allocation w.p. (1 — 0:2), and chooses a random 
power allocation w.p. 0:2- Parameters ai and Q2 characterize the 
level of randomization in the algorithm. In what follows, we always 
assume that < cti, 0:2 < 1. When they are small, the algorithm is 
close to the initial IBPP algorithm, and converges very fast to a fea- 
sible allocation if IBPP can find one, but the algorithm would then 
take more time to identify a feasible allocation that IBPP cannot 
reach. The pseudo-code of the algorithm is presented below. 



Iterative Perturbed Binary-PP (IPB-PP) algorithm. 

Input: target rate vector R\ update sequence s, power allocation 
p[0],/3[0] G {0,1} K . 

For each step t > 1: Let i = s[t]. 

1. Tx i measures interference levels h(p[t — 1]) in the different 
slots. 

2. Tx i updates its power allocation to pi [t] : 

(i) If Ri(p[t-1]) < Rj), then pi[t] is obtained as per BPP 
algorithm with inputs R\ and Ii(p[t — 1]) w.p. 1 — ai 
and is a random power allocation w.p. an; 

(ii) Else If {Pi [t - 1] = 0), pi [t] = P i [t - 1] w.p. (1 - a 2 ) 
and pi [t] is random w.p. c\2\ 

Elsepj[*] =P»[* - !]• 

4. Txisets/3i[t] = 1 if Ri( Pi [t],p-i[t- 1]) > flf, and/3^] = 
otherwise. 



We prove the convergence of IPB-PP under the following as- 
sumption. Let U{p) denote the set of unsatisfied links under binary 
power allocation p. 

(Al) For any power allocation p such that U(p) / and U (p) / 
{1, . . . , TV}, either there exists i G U(p) such that for 
^(-Pmax, P-i) > Rj, or for p' such that p^ 1 for all 

i G U(p) andp'i = pi for i $ U(p), U{p) C U(p'). 

The assumption states that for any given power allocation p, ei- 
ther there exists a unilateral change in the power allocation of an 
unsatisfied transmitter that makes it satisfied, or when unsatisfied 
transmitters all select P ma xl, at least one other link becomes un- 
satisfied. 

THEOREM 6.1. If there exists a binary power allocation p* such 
that R(p*) > R , if (Al) holds, then from any initial condition, 
IPB-PP algorithm converges almost surely to a power allocation p 
such that R(p) > i?*. 

The previous theorem does not lead to the rate-optimality of IPB- 
PP algorithm. Although the algorithm performs well in practice, 
there are still some target rate vectors that it cannot reach. This 
is typically the case where one link has very low target rate, in 
which case, assumption (Al) may not be satisfied (the correspond- 
ing transmitter can be hardly affected by interference). Next we 
propose a rate-optimal algorithm whose principles are similar to 
those of IPB-PP algorithm. 

6.2 Interference-Triggered algorithm 

The next algorithm follows the same design principles as IPB- 
PP algorithm. However, the way satisfied transmitters are forced to 
explore other power allocations is different: they explore new allo- 
cations if they perceive significant changes in interference. More 
precisely, exploration is triggered when the change in the sum of 
the interference measured in the various slots exceeds a threshold 
8. The pseudo-code of this new algorithm is presented below. 



Interference-Triggered IPB-PP (IT-IPB-PP) algorithm. 

Input: target rate vector R f , update sequence s, power allocation 
p[0], previous interference levels J last [0] G R+ . 

For each step t > 1: Let i = s[i\. 



1. Tx i measures interference levels Ii(p[t — 1]) in the different 
slots. 

2. Tx i updates its power allocation to Pi[t]: 

(i) If Ri(p[t-1}) < Rj), then pi [t] is obtained as per BPP 
algorithm with inputs R t and h(p\t — 1]) w.p. 1 — ai 
and is a random power allocation w.p. an; 

(ii) Else If |E™=r Wp[*-l])-4 ast [^l]l > S, Pi [t] = 
Pi[t — 1] w.p. (1 — Q2) and pi [i] is random w.p. 0-2; 

Elsepi[£] = Pi[t — 1]. 

4. Tx i sets l\^[t] = £* f =1 Iim(p[t - 1]). 



We prove the convergence of the algorithm under the following 
assumption on 8. 

(A2) For every set U 7^ or {1, ... , N}, there exists a set U' / 
satisfying (1) U H U' = and (2) for every j G U', 

Assumption (A2) states that any set U of transmitters can be 
"heard" by at least one link in U c = {1, . . . , iV} \ U. Note that as 
long as gij > for every i, j, for any 8 > 0, one can find a frame 
size M (large enough) such that (A2) is satisfied. In this sense, the 
assumption is not restrictive: one may choose S depending on the 
sensitivity of receivers, and then tune M so that (A2) holds. 

THEOREM 6.2. If there exists a binary power allocation p* such 
that R(p*) > R 1 , and if with our choices of 8 and M, (A2) holds, 
then from any initial condition, IT-PIB-PP algorithm converges al- 
most surely to a power allocation p such that R(p) > R ■ 

The above theorem states that if i?* G Sm and if (A2) holds, 
then IT-IPB-PP algorithm converges to a feasible power allocation. 
Now combining, this result with that of Lemma 3.1, we deduce that 
IT-PIB-PP is approximately rate-optimal. To be more precise, for 
e G (0, 1), we say here that an algorithm is e-rate-optimal if it can 
achieve any rate vector 7?* such that 7?* + el G 7£ schcd . 

COROLLARY 6.3. For any e > 0, and any threshold 8 > 0, 
there exsists a frame size M(e, 8) such that if M > M(e, 8), IT- 
IPB-PP algorithm is e-rate-optimal. 

7. THROUGHPUT- OPTIMALIT Y 

In the previous section, we developed an approximately rate- 
optimal and fully distributed scheduling scheme. We now turn our 
attention to scenarios where each transmitter is equipped with an 
infinite buffer where it stores packets before sending them, and we 
address the design of throughput-optimal and distributed schedul- 
ing algorithms. We first describe our assumptions on the arrival 
processes, and on the notion of system stability. 

Arrival processes. We assume that packets arrive in transmitter- 
i's buffer according to an i.i.d. process. Let Ai[t] denote the 
number of bits arriving in transmitter-i's buffer during frame t. 
(Ai[t])t>o forms an sequence of i.i.d. random variables such that 
Ai [t] < A < 00 for all i and t. The mean arrival rate (per frame) at 
transmitter i is denoted by Ai = E[Ai[i]]. Let A = (Ai, . . . , Ajv). 
Finally, we assume that arrival processes are independent across 
transmitters. 

Stability. Let Qi[t] denote the number of bits in transmitter-i's 
buffer at the beginning of frame t. It evolves as: Qi[t + 1] = 
max(0, Qi[t] + Ai[t] — Si[t]), where Si[t] is the number of bits 
sent during frame t. Let B denote the time required to empty all 



queues, i.e., B = inf{it : u > 0,Qi[u] — 0,Vi}. We say that 
the system is stable if E[B|Q[0]] < oo for all initial queue vector 
Q[0] = (Qi[0], ■ ■ -,Qn[0]) such that Qi[0] < oo.foralli We say 
that an algorithm is e-fhroughput optimal if it stabilizes the system 
whenever A + el G TZ schcd . 

We use IT-IPB-PP algorithm to design approximately throughput- 
optimal and fully distributed scheduling schemes. 

7.1 Known arrival rates 

If each transmitter i is aware of its arrival rate Ai, this design is 
straightforward: each transmitter i selects a target rate R\ slightly 
bigger than A;, and we then run the IT-IPB-PP algorithm with these 
target rates, even when its queue is empty (using dummy packets). 
Under this strategy, after convergence of the IT-IPB-PP algorithm 
(which occurs after a finite time with finite mean), queues behaves 
independently and each of them has an arrival rates strictly less than 
its fixed service rate, which ensures stability. Next we make these 
statements precise. 

LEMMA 7.1. Let M be a frame size such that IT-IPB-PP is 
(e/2)-rate-optimal. Assume that A — el G TZ Bchcd . Then under 
IT-IPB-PP algorithm with target rate vector R* = A + (e/2)l, the 
system is stable. 

The above lemma simply states that IT-IPB-PP algorithm pro- 
vides an e-throughput-optimal algorithm, if each transmitter knows 
its arrival rate, e can be made as small as desired by increasing the 
frame size M. 

7.2 Unknown arrival rates 

When the arrival rate A; is not known, transmitter i estimates it. 
When its estimate is precise enough, it selects a target rate appropri- 
ately (again slightly bigger than its estimated arrival rate) and then 
runs the IT-IPB-PP algorithm with this target rate. More precisely, 
for any i, let A»[t] = \ X^s=i Ai[s\ and let /i = e/8. Further de- 
fine the interval e k = [2(k — 2kfj). The target rate vector is 
continuously updated as follows: for any i, 

if \i[t] G e k , then R\[t] = (4k + 1)^/2. 

When Ai lies in the interior for some e k , since Xi[u] — > Ai a.s. 
as ti — > oo, after a finite time Ti, R\\t] does not change any- 
more. In appendix we briefly explain how the case Ai = 2k/j, 
can be handled. The following lemma then relies on the facts that 
R\ [Ti] G (Ai, Ai + e/2) and Epi] < oo (proved in appendix): 

LEMMA 7.2. Let M be a frame size such that IT-IPB-PP is 
(e/2)-rate-optimal. Assume that A— el G 7?. 8chcd , and that IT-IPB- 
PP algorithm is executed jointly with the above target rate update 
algorithm. Then the system is stable. 

According to the above lemma, the proposed joint target rate up- 
date and scheduling algorithm is e-throughput-optimal. It is worth 
remarking that this algorithm proceeds in three phases: in the first 
phase, each transmitter aims at identifying a target rate that is just 
strictly greater than the arrival rate of bits in its buffer; in the sec- 
ond phase, IT-IPB-PP algorithm finds a power allocation compat- 
ible with the target rate vector; and finally, transmitters apply this 
power allocation, and queues empty. Also note that our algorithm 
is not designed so as to adapt to changing traffic conditions (i.e., 
changes in the arrival rates). A way to devise adaptive algorithms 
would be to let each transmitter continuousloy updates its target 
rate, depending on its observed queue length. To study such queue 
length based algorithm, one would need to understand the interac- 
tion between dynamics of the queues and of our IT-IPB-PP algo- 
rithm, which would require a significantly more involved analysis. 



8. NUMERICAL EXPERIMENTS 

In this section, we present simulation results to illustrate the 
rate-optimality of IPB-PP and IT-IPB-PP algorithms. For all ex- 
periments, the sensitivity parameter 8 in IT-IPB-PP algorithm is 
fixed. We first experiment with a 3-link network. The network 
geometry is such that transmitters 1 and 2 strongly interfere link 
3, whereas transmitter 3 does not produce much interference, i.e., 
3i3 = 323 = 60, (731 = <?32 < 1, and the other gains are equal to 1. 
The target rate vector is chosen so that it cannot be reached by sim- 
ple iterative Power Packing. It corresponds to a power allocation 
close to pi = p 2 = (P m ax, -Pmax, 0), p 3 = (0, 0, P max ). Fig. [5] 
shows the convergence time (in number of updates) of IPB-PP and 
IT-IPB-PP algorithms as a function of the exploration rate ai (we 
choose Q2 = «i). The convergence time is averaged over 10,000 
simulations starting from random power allocations. The conver- 
gence time rapidly grows either when the exploration rate is close 
to 0, or when it becomes too large. In the former, the algorithms 
behave like Binary-IPP, and cannot find a feasible allocation. In 
the latter, the algorithms get closer to a random search algorithm, 
and the convergence time explodes. Hence, in IPB-PP and IT-IPB- 
PP algorithms, it is clear that both Power Packing and randomiza- 
tion components are crucial: PP accelerates the convergence and 
randomization helps where PP fails at identifying a feasible allo- 
cation. It is worth noting that when the target rate vector can be 
achieved through simple Power Packing (without randomization), 
the convergence of the algorithm is very fast. 



IPB-PP 
IT-IPB-PP 



0.2 0.4 0.6 0.8 

1 -(exploration rate) 

Figure 5: Convergence time vs Exploration rate ai - N = 3 = 

M 

Next we consider randomly generated networks by placing 10 
links on a 2D square (gains are computed using a path loss expo- 
nent equal to 3). For each generated network topology, we further 
generate 10 4 target rate vectors in 5a/. For each vector we analyze 
the convergence time if the latter remains less than 10 4 updates. 
We use two metrics for comparison: (1) the average (over topolo- 
gies and rate vectors) number of updates required for convergence, 
given that it remains less than 10 4 , and (2) the proportion of rate 
vectors for which the algorithm does not converge in less 10 4 up- 
dates. 

We first investigate the performance of our algorithms when the 
frame size M varies. Here we fix «i = a?2 = 0.1. Figure [6(a)] 
shows that as M increases, the proportion of rate vectors not reached 
by the algorithms rapidly decreases. For IT-IPB-PP, all vectors are 
achieved when M = 16, illustrating the rate-optimality of the al- 
gorithm. Note that this is not the case of IPB-PP, as we predicted. 
Figure [6(b)] shows how the convergence time varies with M. IT- 
IPB-PP seems to conevrge faster, and for both algorithms the con- 
vergence speed is increased when M grows large. 

We now challenge our algorithms, and evaluate their performance 
when the frame size is not sufficient to guarantee the rate-optimality 
of IT-IPB-PP: we fix M — 8, and vary the exploration rate ai. In 
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Figure 6: Performance of IPB-PP and IT-IPB-PP - «i = 0.1 

3000 
1 2500 

s; 2000 





0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 
Exploration rate (alpha! ) 



0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 
Exploration rale (alphal) 



(a) Avg. number of updates fofb) Prop, of non-achieved rate 
convergence vs. exploration ratevectors vs. exploration rate 

Figure 7: Performance of IPB-PP and IT-IPB-PP - M = 8 



Figure |7(a)| we observe that in this case, the convergence time in- 
creases when ai decreases, which again ilustrates the importance 
of the Power Packing component in the algorithms. In Figure |7(b")| 
the proportion of rate vectors not reached by the algorithms within 
10 4 updates seem to increase as cti increases, which indicates the 
negative effect of an aggressive random exploration. 

9. CONCLUSION 

This paper presents the first distributed scheduling algorithms 
that are optimal under the realistic SINR interference model, and 
do not require message passing among transmitters. The fact that 
algorithms combining such properties exist in surprising. Our solu- 
tion is based on combining a simple power allocation strategy, and 
randomization techniques. Without randomization, the power allo- 
cation scheme could not, alone, achieve all parts of the throughput 
region (although numerical experiments show that it reaches a vast 
majority of it), and hence randomization is needed. We actually 
believe that randomization is always needed, i.e., no deterministic 
resource allocation scheme can be optimal. It would be interesting 
to formally establish this result. We are also interested in studying 
the convergence time of our iterative power allocation scheme, and 
its impact on actual queueing delays. 
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APPENDIX 

Proof of Lemma 13.11 

We prove lirriivf->>oo TL V ^ — TZ. H-oim^ooSm = TZ Bchcd can be 
proved analogously. Let R £ 1Z. Since TZ = conv(7?.f c ) C R JV , 
by Caratheodory's theorem, there exist a finite set 



{n, 
Ai,. 



„N+1 



} of (N + 1) points in TZ f c and positive real number 



, Ajv+i such that: R = Y] 



N+l 



i A^and 



Ef^A^L 



For any j = 1, . . . , N + 1, let p> G [0, P ma . K ] N denote the 
vector representing power levels used by the various transmitters 
to achieve rate vector rj. Now for M > 1, we propose the fol- 
lowing power allocation across the M slots of a frame: for all 
j = 1, . . . , N + 1, power levels p 3 are used for [MXj\ slots 
(where [x\ is the largest integer smaller than or equal to x). The 
power allocation is arbitrary for the remaining slots. Using this 
power allocation, the achieved rate vector Um satisfies the follow- 
ing component-wise inequality: Um > X^^i 1 x r i' 

Note that liniAf^oo = -\j f° r all j> an d mus 

lirriA/^oo Um > P. □ 

Proof of Theorem 15.11 

We prove the convergence of IPP algorithm (the proof of the con- 
vergence of Binary-IPP is similar and easier). Let s be a sequence 
of updates satisfying property (PI). Without loss of generality, we 
can assume that s[2t] = 1 and s[2t + 1] = 2, for all t > (i.e., 
transmitters alternatively apply PP algorithm). We denote by p[t] 
the power allocation after the t-th update, and abuse the notation 
by writing R[t] = R(p[t]). It can be readily seen that (i) after 
both transmitters update once, i.e., for t > 2, the power alloca- 
tion p[t] is a-repulsive (repulsive under the slot permutation a), for 
a fixed given permutation a of slots; (ii) Pi[2i] G {0,Pi} and 
R 2 [2t - 1] G {0,P 2 } for any % > 1. Observation (i) can be 
easily proved by induction on t. Given the permutation a, we in- 
troduce the following notation: letpi,pi £ [0, P max ] M , we write 
Pi <a pi if for all m, pi CT(m) < p' 1(T(m) . 

Now let P f G Pm P - Let P* = (pi,P 2 ) t> e a cr-repulsive power 
allocation such that R(p*) = R l ■ We establish the convergence of 
IPP algorithm to P* by investigating various possible initial condi- 
tions. 

Case 1: At time 2t, pi[2t] = 0. This means that after an update, 
link-1 transmitter actually chooses to remain silent. Without loss of 
generality, we assume that t — 0. We show by induction property 
(N t ), stating that the sequence of power allocations is monotoni- 
cally increasing, and that the target rates are alternatively achieved 
on links 1 and 2: 

Property (Nt): pi[2t - 2] < CT Pl [2t] < CT p{, R x [2t] = R\, 
p 2 [2t - 3] < CT p 2 [2t - 1] < a p* 2 , R 2 [2t - 1] = Rt (with the 
convention that p 2 [—1] = 0). 

Let us prove (iVi). We have pi[0] = 0. Then at time 1, link-2 
transmitter applies PP algorithm, and selects allocation P2[l] such 
that -ffefl] = R\- Observe that link 2 has no interference, so that 
P2[l] < CT p 2 - Then link-1 transmitter updates its power allocation. 
Since P2[l] < CT P2, it can choose an allocation pi[2] such that 
Ri[2] = R\ andpi[l] < a p\. Thus (Ni) holds. Now assume that 
(N t -i) holds, and let us prove (iV t ). At time 2t, link-1 transmitter 
updates its power, and since p 2 [2t — 1] < CT p 2 , it can choose an 
allocation pi [2t] such that Ri [2t] = R\ andpi[2i] < CT p*. The 
same argument applied for link-2 transmitter allows to finish the 
proof of (N t ). 

Now p[t] is monotonically increasing (w.r.t. < CT ), and hence 
it converges. Remark that because of monotonicity, after a finite 
number of updates, the number of slots used by transmitter 1 or 
2 is fixed. Hence after these numbers are fixed, the transmitters 
just update power on a single slot (always the same). The updates 
correspond to the synchornous version of Foschini-Miljanic algo- 
rithm, and hence converge to a feasible solution. In other words, if 
limt^oo p[t] = p, then R(p') = R l . 

The case where at time 2t + l,p 2 [2t + 1] = 0, is similar to Case 

1. 

Case 2: At time 2t + 1, R 2 [2t + 1] = R\ and Ri[2t + 1] < R\. 



Without loss of generality, asume that t = 0. Link-1 transmitter 
updates its power at time 2. There are two cases: 

(i) -Ri(P m axl,P2[l]) < R\, in which case, pi[2] = 0, and we re- 
turn to Case 1 ; 

(ii) fll(P ma xl,P2[l]) > P'l, in which case, pi[2] is such that 
Pi[2] = Pi andpi[2] > a pi[0] (because Pi[l] < Pi[2]). Now 
we have P2[2] < R\ because interference increased for link 2. 
We can show using induction arguments just as those used in Case 
1 that the power allocation is monotonically increasing until one 
transmitter saturates and becomes silent. In the latter case, we are 
back to Case 1. If transmitters never reset their power, we have 
convergence towards the target rates (using the same argument as 
in Case 1). 

The case where at time 2t, Ri[2t] = R\ and R 2 [2t] < P|, is 
similar to Case 1 . 

Case 3: At time 2t + 1, R 2 [2t + 1] = R 2 and Pi[2t + 1] > Pi. 
In this case again, we can show convergence using monotonicty ar- 
guments exactly as in previous cases. Note that in this case, power 
allocations are monotonically decreasing. The case where at time 
2t, Pi [2t] = Pi and P 2 [2t] > R\ is of course similar. □ 

Proof of Theorem 16.11 

Since the update sequence satisfies (P2), 

x[t] = (s[t],p[£], /3[t]) t >o is an homogenous Markov chain with 
finite state space. Observe that a set {(s,p, 1), s G {1, . . . , A^}} 
constitutes a communication class of this Markov chain if P(p) = 
P (in such states, all links are satisfied, and do not update their al- 
locations anymore). To prove the theorem, we just need to show 
that from any initial state, at least one of these communication 
classes are accessible, i.e., we construct a finite sequence of state 
transitions occuring with positive probability and leading to one of 
the aforementioned communication classes. To construct such a 
path, we use the fact that from any state, all transmitters are picked 
for possible a power update with positive probability. We also use 
the fact that if tx i is chosen for an update, and either its target rate 
is not satisfied or its is equal to 0, then tx i can pick any power 
allocation with positive probability. 

W.l.o.g. we may assume that ?7(p[0]) 7^ and that there is no 
tx i G U (p[0]) that can update its allocation and become satisfied. 
Indeed if this is not the case, we pick this tx. With positive probail- 
ity, it updates its power allocation to P ma xl and becomes satisfied. 
We repeat this procedure: pick an unsatisifed tx that can become 
satisfied, and let it use P ma xl. From allocation p[0], power levels 
have been only increased, and so we end up at a state where there 
is no unsatisifed tx that can become satisified by unilateral power 
update. 

Our constructed path consists of phases, indexed by A; = 0, 1,2, .... 
At the beginning of phase k, the set of unsatisfied links whose trans- 
mitters is not using allocation P max l is denoted by Vk- Phase 
k consists in letting links from Vk select allocation P max l (this 
occurs with positive probability because these links are not statis- 
fied). Note first that Vq 7^ 0, for by assumption (Al), when tx from 
U(p[0]) use P max l, one satisfied link becomes unsatisfied. Such 
an update requires that at least a tx from £7(p[0]) is able to increase 
interference, and hence is not already using allocation P max l. We 
prove similarly that at the beginning of phase k > 1, either Vk 7^ 
or every transmitter uses P max l. Assume that Vk = 0, which 
means that all unsatisfied transmitters use P max l. Hence unsat- 
isfied transmitters cannot change their power allocation either to 
become satisfied or to disatisfy one link. From (Al), we deduce 
that all links are unsatisfied, and hence all use P max l. In summary 
after at most iV phases, all links are unsatisfied and use allocation 
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After all links have become unsatisfied, we add the following 
phase. We pick tx one after the other once. When tx i is picked, 
either it is unsatisfied, or due to power updates of previous trans- 
mitters, it has become satisfied, but the value of its parameter /3j is 

(because it was not picked earlier in this phase). Hence when tx 

1 is picked, it will update its power allocation. With positive prob- 
ability, it selects p* . After the last tx is picked, each tx is satisfied, 
but the /3i's may not be all equal to 1. Finally we add a last phase: 
only the tx's i such that fli = are picked, and they again select 
power allocation p* . Thus we constructed a positive probability 
path from any state to a state where every tx is satisfied and will 
not update its power again. □ 



Proof of Theorem 

The proof is similar to that of Theorem |6.1| 

y[t] = (s[£],p[t], i last [t])t>o is an homogeneous Markov chain 
with finite state space. A set {(s,p, I),s G {1, . . . , iV}} consti- 
tutes a communication class of this Markov chain if R(p) > 72* 
(in such states, all links are satisfied, and do not update their allo- 
cations anymore). We show that these classes are accessible, and 
from any state, we build a positive probability path towards one of 
these classes. 

Let y[0] b e an y initial state of the Markov chain. As in the proof 
of Theorem 6.1 w.l.o.g. we may assume that [7(p[0]) 7^ and 
that there is no tx i G U(p[0]) that can update its allocation and 
become satisfied. Let Uo = U(p[0]). By (A2), there exists a max- 
imal set Ui / such that Uo n Ui = and for every j G U', 
A7fmax Yli£u 9*3 ^ ^' ^ et ^ 1 ls max ' ma l m a sense that no set 
U' D U\ satisfies (A2) for the set Uo- Similarly, we recursively 
define U w as the maximal set satisfying (A2) for the set U™ =1 Ue 
it\Jf =1 U t / {1,...,N}. Let W: \jf =1 Ut = {1,...,N}. Note 
that such W exists and is less than or equal to N. Also note that 
the sets Ue's define a partition of {1, ... , N}. 

Our constructed path consists of phases, indexed by k = 0, 1, 2, ... 



We show that we can build these phases with positive probability 
such that: 

(i) In each phase, all tx's are selected once; tx's from Uo are se- 
lected first, then tx's from Ui, and so on. (ii) In phase k, the tx's 
not in Uk+i U . . . U Uw do not update their power allocation, (iii) 
In phase 2k, each tx i updating its allocation selects pi — 0. In 
phase 2k + 1, they select P max l. 

(iv) there is a phase that ends with all tx having power allocation 0. 
If this construction is valid, then from the state where all tx remain 
silent, we conclude as in the proof of Thoerem |6.1| we let each tx 
pick p*, and run two phases to align the variables 1^ *. 

We now justify (i)-(ii)-(iii)-(iv). (i)-(ii) are immediate (s satis- 
fies (P2), and a tx may always pick the same allocation as before 
with positive probability). Note that because of (i), in each phase, 
each tx i updates its value of /' ast , In phase 0, all tx's in Uo are 
unsatisfied, they update their power, and all choose with positive 
probability. At the beginning of phase 1, tx's in Uo are unsatisfied, 
and pick P max l; after that, from (A2), any tx i in Ui noticed the 
increased interference in its parameter J last , and hence update its 
power allocation with positive probability - it selects P max l. In 
phase 2, tx's in Uo are still not satisfied because from their perspec- 
tive, interference has increased compared to that perceived initially; 
they can then update their allocations again and this time select 
power. This will be noticed by tx's in Ui, that again will update 
their allocations and select power. In phase 3, tx's in Uo and Ui 
will select allocation P max l, which will be noticed by tx's U2. The 
latter will then select allocation P max l. Repeating this argument, 



we justify (iii). (iv) is readily deduced from (iii). □ 

Proof of Lemma 17.11 

Let T be the time at which the IT-IPB-PP has converged. Since 
T is the absorbing time of a finite state Markov chain, we have 
E[T] < oo. Now at T, a worst case (sample-path wise) is obtained 
by assuming that in each queue i, there are AT + Qi[0] bits to 
be served. From T, queues behave independently, and are also 
independent of the r.v. T. Thus the system is stable if and only 
if each queue is stable. It remains to prove that each queue i is 
stable. W.l.o.g., assume that at time 0, queue i has AT + Qi[0] 
bits to be served, and let Bi = inf{u : Qi(u) — 0}. Define 
A *M = 5 E*=l M a ]- Let S = R\ - Xi > 0. We have: 

¥[Bi >u]< V[AT + Qi[0] + u\i[u] > uR*] 

+ \i[u] - \ t > 5} 

> £]+P[A,[u]-A,>|] 
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where the last inequality is obtained using Hoeffding's inequality 
(ci,c 2 > 0). We deduce that E[B t ] = J2^Li ¥ l B ^ > u \ < °°> 
and queue i is stable. □ 



Proof of Lemma 17.21 

We just need to prove here that E[2}] < oo and R\ [Ti] G (A.;, Aj + 
e/2). After establishing these results, we can apply the same proof 
as that of Lemma [7T| Indeed, note that after maxiTi, at each 
transmitter, the target rate is fixed and greater than the arrival rate; 
also observe that E[max, Zi] < ^ E[Tj]. 

We only consider that Ai lies in the interior of for some k^ 
Thus, there exists 8 > such that ^-neighborhood of Ai lies in ek- 
Let Ts = inf{i : sup u>t |Aj[u] — A»| < 5}. Note that for every 
t > T s , Xi[t] £ e k and thus R\[t] = (4k + 1)^/2. Also observe 
that Ti <T S . We show that E[T S ] < oo. Consider ¥{T S > t} and 
note that 

¥{T S >t}= P{LC =m {|Ai[w] - A»| > 6}} 
< J2 P{|AiH-Ai| > 6} 
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The last inequality follows from Hoeffding's inequality (ci,C2 > 
0). Now, E[T S ] = Et=i F (^ > *} < Et=i t e " C2 ' < oo- 
Hence, E[Tj] < oo. Finally, from the fact that A; G and 
R\{Ti\ = (4k + l)/i/2, we simply deduce that R\[Ti\ G (Ai, A; + 
e/2). 



"The case where A,: may lie on the boundary of some can be 
handled similarly by choosing a slightly more complex target rate 
update algorithm: we consider two partitions of R + , (et)k>i and 
(/fc)fc>o where f k = e k + fi/2 for k > 1 and /„ = [0,/i/2). We 
consider the same rate update, but switch partition when A^ [t] falls 
into a different interval than that of A^ [t — 1]. Using this, after Ai [t] 
concentrates around Ai, we do not switch partition anymore, and 
Ai lies in the interior of an interval of the partition. 



